Compositions and methods for producing benzylisoquinoline alkaloids

ABSTRACT

The present invention relates to host cells that produce compounds that are characterized as benzylisoquinolines, as well as select precursors and intermediates thereof. The host cells comprise one, two or more heterologous coding sequences wherein each of the heterologous coding sequences encodes an enzyme involved in the metabolic pathway of a benzylisoquinoline, or its precursors or intermediates from a starting compound. The invention also relates to methods of producing the benzylisoquinoline, as well as select precursors and intermediates thereof by culturing the host cells under culture conditions that promote expression of the enzymes that produce the benzylisoquinoline or precursors or intermediates thereof.

CROSS-REFERENCE TO RELATED APPLICATIONS

The application claims priority to U.S. Provisional Application Nos. 60/852,954 filed 19 Oct. 2006 and 60/859,149, filed 15 Nov. 2006, both of which are incorporated by reference in their entirety.

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT

Part of the work performed during development of this invention utilized U.S. Government funds through National institutes of Health Grant No. GM077346. The U.S. Government has certain rights in this invention.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The invention relates to compositions and methods for producing benzylisoquinoline alkaloids (BIAs) or molecules involved in the production of BIAs. The compositions comprise host cells comprising at least one heterologous coding sequence that encodes for an enzyme or its equivalent that is involved in the BIA synthetic pathway.

2. Background of the Invention

Alkaloids are a diverse group of nitrogen-containing small molecules that are produced in plants, marine organisms, and microorganisms through complex biosynthetic pathways. These complex molecules exhibit a range of interesting pharmacological activities and have been used as antimalarials, anticancer agents, analgesics, and in treatment of parkinsonism, hypertension, and central nervous system disorders.

The benzylisoquinoline alkaloids (BIAs) are a family of alkaloid molecules with over 2,500 defined structures. The most common BIAs currently utilized as medicinal compounds are synthesized in the opium poppy and include the analgesics codeine and morphine. However, many intermediates in this pathway that do not accumulate to significant levels in plants are themselves pharmacologically active as analgesics, antimalarials, anticancer agents, and antimicrobial agents. Even for molecules that accumulate to high levels in plants, it would be advantageous to eliminate the rigorous extraction and purification procedures required to isolate these compounds.

Chemical synthesis of these types of molecules is normally a costly and time-consuming process, often requiring harsh process conditions, generating toxic waste streams, and resulting in low quantities of the chemicals. In addition, many structures are simply unattainable using traditional synthesis methods due to the number of chiral centers and reactive functional groups. Alternatively, the production of BIAs can be achieved at relatively low cost and high yields in a microbial host. This will allow for cost-effective large scale production of intermediate and end-product BIAs.

The inventors have developed methods and compounds for the production of complex BIAs and their intermediates. Specifically, one can generate these molecules by expressing cloned and synthetic cDNAs in the host organism such that precursor molecules naturally produced in yeast, specifically L-tyrosine, are converted to various BIA intermediates in these engineered strains through a series of specific reactions catalyzed by recombinant enzymes. Engineered yeast strains can also be used to convert more complex substrates into value-added BIA molecules using similar strategies. The novel technology developed is the production of this family of alkaloid molecules in yeast from simple precursor molecules and/or more complex substrates using yeast or another microorganism as a host for the production of these molecules. Various BIA intermediates will be produced in yeast and can be used directly for their pharmacological activities or they can be used as starting molecules for chemical synthesis modifications to place additional functional groups on these backbone molecules to alter their pharmacological activities. For instance, one important intermediate reticuline is a molecule from which a number of pharmacologically active molecules such as sanguinarine and codeine can be synthesized. In addition, host cells can be engineered to produce non-natural alkaloid derivatives by adding novel enzymatic conversion steps to the heterologous pathway or eliminating steps from the native or heterologous pathway.

Microbial biosynthesis enables green synthesis and the production of these molecules without extreme reaction conditions and toxic waste streams. Furthermore, many intermediates of interest do not accumulate in the native plant hosts, and studies have demonstrated that modifying expression of specific genes in this pathway in the native plant hosts in order to direct accumulation of specific intermediates often inactivates multiple enzymes in the pathway, prohibiting the rational engineering of plant strains to accumulate specific intermediates. Microbial biosynthesis also eliminates the need for rigorous extraction and purification procedures required to isolate target molecules from the native host.

SUMMARY OF THE INVENTION

The present invention relates to host cells that produce compounds classified as benzylisoquinoline alkaloids, as well as select precursors and intermediates thereof. The host cells comprise one, two or more heterologous coding sequences wherein each of the heterologous coding sequences encodes an enzyme involved in the metabolic pathway of a benzylisoquinoline, or its precursors or intermediates from a starting compound. The invention also relates to methods of producing the benzylisoquinoline, as well as select precursors and intermediates thereof by culturing the host cells under conditions that promote expression and activity of the necessary enzymes that produce the benzylisoquinoline or precursors or intermediates thereof, as well as optimize the growth rate of the host.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 depicts a synthetic pathway present in the host cells of the present invention. The pathway begins with tyrosine and ends with reticuline. The pathway can include fewer enzymes than those displayed if the desired end result is one of the intermediates in the tyrosine

reticuline pathway.

FIG. 2 depicts measurement of dopamine production from a culture of host cells of the present invention.

FIG. 3 depicts alternative pathways for dopamine and 4-HPA production. FIG. 3A shows pathways to produce dopamine from tyrosine through either tyramine or L-dopa and FIG. 3B shows pathways to product 4-HPA from tyrosine through either tyramine or 4-hydroxyphenylpyruvate.

FIG. 4 depicts measurement of norcoclaurine production from a culture of host cells of the present invention. In this particular culture, the cells possessed the NCS heterologous coding sequence and the growth media was supplemented with dopamine and 4-HPA. FIG. 4A shows the pathway of norcoclaurine production and FIG. 4B shows the chromatogram confirming production of norcoclaurine.

FIG. 5 depicts the synthetic pathway present in embodiments of host cells in the present invention. The pathway begins with norlaudanosoline and ends with reticuline. The pathway can include fewer enzymes than those displayed if the desired end result is one of the intermediates in the norlaudanosoline

reticuline pathway.

FIG. 6 depicts the measurement of intermediates in the norlaudanosoline

reticuline pathway. FIG. 6A shows the levels of methyl norlaudanosoline produced, and FIG. 6B shows the levels of methyl laudanosoline produced.

FIG. 7 depicts the measurement of reticuline from the host cells shown in FIG. 5.

FIG. 8 depicts the levels of substrate conversion obtained from host cells expressing different coding sequences available for the enzymes of the pathway. The data demonstrate that any combination of enzyme variants (obtained from different native host sources) will produce reticuline from the substrate. However, it is observed that certain combinations produce higher levels of reticuline than others.

FIG. 9 depicts measurement of levels of reticuline production when fed various amounts of the starting substrate.

FIG. 10 depicts measurement of levels of reticuline production from the supplied substrate at various points in the growth cycle of the host cells. The data demonstrate that the cells continue to produce and accumulate reticuline well into stationary phase, suggesting different fermentation strategies for maximizing reticuline production.

FIG. 11 depicts measurement of levels of reticuline production when the coding sequences for the heterologous enzymes are either integrated into the genome or expressed from plasmids. The data demonstrate that integration does not affect the level of accumulation of the desired BIA, confirming that the enzymes remain functional and expression is sufficient when integrated into the host genome.

FIG. 12 depicts the synthetic pathway present in embodiments of host cells in the present invention. Although the pathway may be longer, starting from tyrosine or norlaudanosoline as shown in other figures, this particular pathway begins with reticuline and ends with either laudanine or canadine. The pathway can include fewer enzymes than those displayed if the desired end result is one of the intermediates in the norlaudanosoline

canadine pathway.

FIG. 13 depicts the measurement of intermediates in the reticuline

canadine pathway. FIG. 13A shows the level of canadine produced, FIG. 13B shows the level of tetrahydrocolumbamine produced, FIG. 13C shows the level of scoulerine produced, and FIG. 13D shows the level of reticuline produced. Characteristic MS/MS fragmentation patterns are also shown for each ion.

FIG. 14 depicts a synthetic pathway from reticuline to thebaine. Note that the conversion of salutaridinol-7-O-acetate to thebaine is spontaneous, thus not requiring additional enzymatic steps.

FIG. 15 depicts salutaridine production in host cells comprising heterologous sequences coding for 6OMT, CNMT, 4′OMT, yCPR1 and yCYP2D6. The pathway synthesizes salutaridine when the cells are fed laudanosoline. The characteristic MS/MS fragmentation pattern is also shown for this ion.

FIG. 16 depicts exemplary combination and subcombination pathways of the present invention. FIGS. 16A-C depict exemplary overall combination pathways and FIGS. 16D-I depict the subcombination pathways including the chemical species and enzymes involved in the pathways. Pathway designations in FIGS. 16D-I refer to the pathway designations in FIGS. 16A-C. FIG. 16J is but one embodiment of the methods of the present invention that combines a few of the subcombination pathways to produce a BIA.

DETAILED DESCRIPTION OF THE INVENTION

The present invention relates to compositions and methods for producing benzylisoquinoline alkaloids (BIAs). In particular, the invention relates to host cells that have been genetically engineered to express recombinant and/or have altered expression of endogenous enzymes involved in the biosynthesis of BIAs and their intermediates and derivatives.

In one embodiment, the cells of the present invention are non-plant cells. In a more particular embodiment, the cells are insect cells, mammalian cells, bacterial cells or yeast cells. Representative examples of appropriate hosts include, but are not limited to, bacterial cells, such as Bacillus subtilis, Escherichia coli Streptomyces and Salmonella typhimuium cells and insect cells such as Drosophila S2 and Spodoptera Sf9 cells. In one specific embodiment, the cells are yeast cells or E. coli cells. In a more specific embodiment, the yeast cells can be of the species Saccharomyces cerevisiae (S. cerevisiae). Yeast is also an ideal host cell because cytochrome P450 proteins, which are involved in certain steps in the synthetic pathways, are able to fold properly into the endoplasmic reticulum membrane so that activity is maintained, as opposed to bacterial cells which lack such intracellular compartments. Examples of yeast strains that can be used in the invention include, but are not limited to, S288C, W303, D273-10B, X2180, A364A, Σ1278B, AB972, SK1 and FL100. In specific examples, the yeast strain is any of S288C (MATα; SUC2 mal mel gal2 CUP1 flo1 flo8-1 hap1), BY4741 (MATa; his3Δ1; leu2Δ0; met15Δ0; ura3Δ0), BY4742 (MATα; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0), BY4743 (MATa/MATα; his3Δ1/his3Δ1; leu2Δ0/leu2Δ0; met15Δ0/MET15; LYS2/lys2Δ0; ura3Δ0/ura3Δ0), and WAT11 or W(R), derivatives of the W303-B strain (MATa; ade2-1; his3-11, -15; leu2-3,-112; ura3-1; canR; cyr+) which express the Arabidopsis thaliana NADPH-P450 reductase ATR1 and the yeast NADPH-P450 reductase CPR1, respectively. In another specific embodiment, the particular strain of yeast cell is W303α (MATα; his3-11,15 trp1-1 leu2-3 ura3-1 ade2-1), which is commercially available. The identity and genotype of additional examples of yeast strains can be found at EUROSCARF, available through the World Wide Web at web.uni-frankfurt.de/fb15/mikro/euroscarf/col_index.html.

Other example of cells that can serve as host cells are included, but not limited to, the strains listed in the table below.

TABLE I EUROSCARF/Open Biosystems CSV Accession # ORF deleted Gene Strain Background 3 n/a wild type W303; Mat α; his3-11, 15 trp1-1 leu2-3 ura3-1 ade2-1 142 n/a YER073w/YPL061w ALD5/ALD6 W303; Mat α; his3-11, 15 trp1-1 leu2-3 ura3-1 ade2-1 152 Y10753 YMR170c ALD2 BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 153 Y10752 YMR169c ALD3 BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 154 Y11671 YOR374w ALD4 BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 155 Y10213 YER073w ALD5 BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 156 Y12767 YPL061w ALD6 BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 157 Y16510 YML110c COQ5 BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 158 Y16246 YOL096c COQ3 BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 159 Y13675 YDR316w OMS1 BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 160 30701B YGR001c CVDM003-01A BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 161 Y11457 YIL064w BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 162 Y15719 YBR271w BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 163 B0006B YJR129c CEN.EN2-1B BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 164 B0199B YNL024c CEN.HE27-2C BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 165 Y12984 YNL092w BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 166 Y12811 YPL017c BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 167 Y12903 YHR209w BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 417 16236 YOL086C ADH1 BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 418 10891 YMR303C ADH2 BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 419 16217 YMR083W ADH3 BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 420 14623 YGL256W ADH4 BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 421 13284 YBR145W ADH5 BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 422 16460 YMR318C ADH6 BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 423 15821 YCR105W ADH7 BY4742; Mat α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0; 151 Y10000 wild type BY4742 MAT α; his3Δ1; leu2Δ0; lys2Δ0; ura3Δ0

The cells can be in any environment, provided the cells are able to express functional heterologous enzymes. In particular, the cells can be used in either in vitro or in vivo experiments. To be clear, in vitro, as used in the present invention, simply means outside of a living cell, regardless of the location of the cell. The term in vivo, on the other hand, indicates inside a cell, regardless of the location of the cell. In one embodiment, the cells are cultured under conditions that are conducive to enzyme expression and with appropriate substrates available to allow production of BIAs in vivo. Alternatively, the functional enzymes can be extracted from the host for production of BIAs under in vitro conditions. In another embodiment, the host cells can be placed back into a multicellular host organism. The host cells can be in any phase of growth, such as, but not limited to, stationary phase and log-growth phase, etc. In addition, the cultures themselves may be continuous cultures or they may be batch cultures.

The cell culture conditions for a particular cell type are well-known in the art and need not be repeated herein. In one particular embodiment, the host cells that comprise the various heterologous coding sequences can be cultured under standard or readily optimized conditions, with standard cell culture media and supplements. As one example, standard growth media when selective pressure for plasmid maintenance is not required may contain 20 g/L yeast extract, 10 g/L peptone, and 20 g/L dextrose (YPD). Host cells containing plasmids can be grown in synthetic complete (SC) media containing 1.7 g/L yeast nitrogen base, 5 g/L ammonium sulfate, and 20 g/L dextrose supplemented with the appropriate amino acids required for growth and selection. Alternative carbon sources which may be useful for inducible enzyme expression include sucrose, raffinose, and galactose. Cells can be grown at 30° C. with shaking at 200 rpm, typically in test tubes or flasks in volumes ranging from 1-1000 mL, or larger, in the laboratory. Culture volumes can also be scaled up for growth in larger fermentation vessels, for example, as part of an industrial process.

The term “host cells,” as used in the present invention, are cells that harbor the heterologous coding sequences of the present invention. The heterologous coding sequences could be integrated stably into the genome of the host cells, or the heterologous coding sequences can be transiently inserted into the host cell. As used herein, the term “heterologous coding sequence” is used to indicate any polynucleotide that codes for, or ultimately codes for, a peptide or protein or its equivalent amino acid sequence, e.g., an enzyme, that is not normally present in the host organism and can be expressed in the host cell under proper conditions. As such, “heterologous coding sequences” includes additional copies of coding sequences that are normally present in the host cell, such that the cell is expressing additional copies of a coding sequence that are not normally present in the cells. The heterologous coding sequences can be RNA or any type thereof, e.g., mRNA, DNA or any type thereof, e.g., cDNA, or a hybrid of RNA/DNA. Examples of coding sequences include, but are not limited to, full-length transcription units that comprise such features as the coding sequence, introns, promoter regions, 3′-UTRs and enhancer regions.

“Heterologous coding sequences” also includes the coding portion of the peptide or enzyme, i.e., the cDNA or mRNA sequence, of the peptide or enzyme, as well as the coding portion of the full-length transcriptional unit, i.e., the gene comprising introns and exons, as well as “codon optimized” sequences, truncated sequences or other forms of altered sequences that code for the enzyme or code for its equivalent amino acid sequence, provided that the equivalent amino acid sequence produces a functional protein. Such equivalent amino acid sequences can have a deletion of one or more amino acids, with the deletion being N-terminal, C-terminal or internal. Truncated forms are envisioned as long as they have the catalytic capability indicated herein. Fusions of two or more enzymes are also envisioned to facilitate the transfer of metabolites in the pathway, provided again that catalytic activities are maintained.

Operable fragments, mutants or truncated forms may be identified by modeling and/or screening. This is made possible by deletion of, for example, N-terminal, C-terminal or internal regions of the protein in a step-wise fashion, followed by analysis of the resulting derivative with regard to its activity for the desired reaction compared to the original sequence. If the derivative in question operates in this capacity, it is considered to constitute an equivalent derivative of the enzyme proper.

Codon optimization is a well-known technique for optimizing the expression of heterologous polynucleotides in host cells and is reviewed in Gustafsson, C. et al, Trends Biotechnol, 22:346-353 (2004), which is incorporated by reference in its entirety.

The present invention also relates to heterologous coding sequences that code for amino acid sequences that are equivalent to the native amino acid sequences for the various enzymes. An amino acid sequence that is “equivalent” is defined as an amino acid sequence that is not identical to the specific amino acid sequence, but rather contains at least some amino acid changes (deletions, substitutions, inversions, insertions, etc.) that do not essentially affect the biological activity of the protein as compared to a similar activity of the specific amino acid sequence, when used for a desired purpose. The biological activity refers to, in the example of a decarboxylase, its catalytic activity. Equivalent sequences are also meant to include those which have been engineered and/or evolved to have properties different from the original amino acid sequence. Examples of mutable properties include catalytic activity, substrate specificity, selectivity, stability, solubility, localization, etc. In specific embodiments, an “equivalent” amino acid sequence contains at least 80%-99% identity at the amino acid level to the specific amino acid sequence, in particular at least about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94% and more in particular, at least 95%, 96%, 97%, 98% and 99% identity, at the amino acid level. In some cases, the amino acid sequence may be identical but the DNA sequence is altered such as to optimize codon usage for the host organism, for example.

The host cells may also be modified to possess one or more genetic alterations to accommodate the heterologous coding sequences. Alterations of the native host genome include, but are not limited to, modifying the genome to reduce or ablate expression of a specific enzyme that may interfere with the desired pathway. The presence of such native enzymes may rapidly convert one of the intermediates or final products of the pathway into a metabolite or other compound that is not usable in the desired pathway. Thus, if the activity of the native enzyme were reduced or altogether absent, the produced intermediates would be more readily available for incorporation into the desired product. For example, if the host cell is a yeast cell and the desired pathway produces 4-HPA from tyrosine, or a downstream metabolite thereof, it may be beneficial to reduce or ablate expression of the native endogenous alcohol and/or aldehyde dehydrogenase enzymes, which could convert the desired final product (4-HPA) into tyrosol or 4-hydroxyphenylacetic acid, respectively. Genetic alterations may also include modifying the promoters of endogenous genes to increase expression and/or introducing additional copies of endogenous genes. Examples of this include the construction/use of strains which overexpress the endogenous yeast NADPH-P450 reductase CPR1 to increase activity of heterologous P450 enzymes. In addition, endogenous enzymes such as ARO8, 9, and 10, which are directly involved in the synthesis of intermediate metabolites, may also be overexpressed.

The heterologous coding sequences of the present invention are sequences that encode enzymes, either wild-type or equivalent sequences, that are normally responsible for the production of BIAs in plants. The enzymes for which the heterologous sequences will code can be any of the enzymes in the BIA pathway, and can be from any known source. For example, Norcoclaurine synthase (NCS; EC 4.2.1.78) is found in at least Thalictrum flavum, Papaver somniferum, and Coptis japonica and is known to catalyze the condensation reaction of dopamine and 4-hydroxyphenylacetaldehyde (4-HPA) to form the trihydroxylated alkaloid (S)-norcoclaurine, which is widely accepted as the first “committed” step in the production of BIAs in plants. The choice and number of enzymes encoded by the heterologous coding sequences for the particular synthetic pathway should be chosen based upon the desired product. For example, the host cells of the present invention may comprise at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 or more heterologous coding sequences.

With regard to the heterologous coding sequences, the sequences are as reported in GENBANK unless otherwise noted. For example, the codon-optimized CYP2D6 sequence is included for reference along with the human monoamine oxidase A sequence with the first 10 amino acids optimized to facilitate translation initiation and proper folding in yeast.

A non-exhaustive list of enzymes that are contemplated in the present invention is shown in the table below. The host cells of the present invention may comprise any combination of the listed enzymes, from any source. Unless otherwise indicated, Accession numbers in Table I refer to GenBank. Some accession numbers refer to the Saccharomyces genome database (SGD) which is available on the world-wide web at www.yeastgenome.org.

TABLE II Example Source Organism Enzyme Name Abbrev. (Accession #) Reference Catalyzed Reactions L-tyrosine/dopa TYDC1 P. somniferum Facchini, P. J. and De L-tyrosine-

L- decarboxylase 1 U08597 Luca, V. J. Biol. tyramine, T. flavum Chem. 269 (43), L-dopa-

L-dopamine AF314150 26684-26690 (1994). PUBMED 7929401 L-tyrosine/dopa TYDC2 P. somniferum Facchini, P. J. and De L-tyrosine-

L- decarboxylase 2 U08598 Luca, V. J. Biol. tyramine, V. vinifera Chem. 269 (43), L-dopa

L-dopamine AM429650 26684-26690 (1994). PUBMED 7929401 Cytochrome P450 CYP2D6 H. sapiens Hiroi, T, Imaoka, S, L-tyramine

L- 2D6 NM000106 and Funae, Y. dopamine S. cerevisiae Biochem Biophys Res (R)-reticuline

(R)- codon- Commun. 1998 Aug salutaridine optimized 28; 249(3): 838-43. codeine

morphine PUBMED 9731223 Zhu, W et. Al. The Journal of Immunology. Vol. 175, pp. 7357-7362 (2005). NADPH p450 CPR1 S. cerevisiae Turi, TG and Loper, L-tyramine-

L- reductase SGDID: JC. J Biol Chem. dopamine S000001084 1992 Jan (R)-reticuline

(R)- 25; 267(3): 2046-56. salutaridine PUBMED 1730736 codeine

morphine Polyphenyloxidase PPO A. bisporus Wichers, HJ et. Al. L-tyrosine

L-dopa X85112, Appl. Microbiol. AJ223816 Biotechnol. (2003) 61: 336-341. Tyrosine TyrH R. norvegicus Grima, B., L-tyrosine

L-dopa hydroxylase NM 012740 Lamouroux, A., Blanot, F., Biguet, N. F. and Mallet, J. Proc. Natl. Acad. Sci. U.S.A. 82 (2), 617-621 (1985). PUBMED 2857492 Tyrosine TH2 H. sapiens Grima, B., L-tyrosine

L-dopa hydroxylase NM 000240 Lamouroux, A., Boni, C., Julien, J. F., Javoy-Agid, F. and Mallet, J. Nature 326 (6114), 707-711 (1987). PUBMED 2882428 GTPcyclohydrolase I GTPCH1 H. sapiens Leff, SE et. al. Produces BH₄ cofactor NM 00161 Experimental for tyrosine Neurology 151, 249-64 hydroxylase reaction (1998). L-tyrosine

L-dopa Monoamine MaoA H. sapiens Bach, WJ et. al. Proc. L-tyramine

4-HPA oxidase A J03792 Natl. Acad Sci. USA. Vol. 85, pp. 4934-4938, July 1988. Monoamine maoA E. coli Azakami, H et. al. J. L-tyramine

4-HPA oxidase D2367 Ferment. Bioeng. 77, 315-319, 1994. Tyramine oxidase tynA K. aerogenes Cooper, RA. FEMS L-tyramine

4-HPA AB200269 Microbiol Lett. 1997 Jan 1; 146(1): 85-9. PUBMED 8997710 Aromatic amino ARO8 S. cerevisiae Iraqui, I et. al. Mol. L-tyrosine

4- acid transaminase SGDID: Gen. Genet. (1998) hydroxyphenylpyruvate S000003170 257: 238-248. Aromatic amino ARO9 S. cerevisiae Iraqui, I et. al. Mol. L-tyrosine

4- acid transaminase SGDID: Gen. Genet. (1998) hydroxyphenylpyruvate S000001179 257: 238-248. Phenylpyruvate ARO10 S. cerevisiae Vuralhan, Z. et. al. 4-hydroxyphenyl- decarboxylase SGDID: Appl. And Environ. pyruvate

4-HPA S000002788 Microbiol. Vol. 71, No. 6, p. 3276-3284. Norcoclaurine NCS T. flavum, Samanani, N, L-dopamine+4-HPA

synthase AY376412 Liscombe, DK, and (S)-norcoclaurine P. somniferum Facchini, P. The AY860500, Plant Journal (2004) AY860501 40, 302-313. Norcoclaurine 6- 6OMT T. flavum Ounaroon, A. et. al. (S)-norcoclaurine

O- AY61057 The Plant Journal (S)-coclaurine methyltransferase P. somniferum (2003) 36, 808-819. norlaudanosoline

6- AY268894 O-methyl norlaudanosoline laudanosoline

6-O- methyl laudanosoline Coclaurine-N- CNMT T. flavum Choi, Kum-Boo et. coclaurine

methyltransferase AY610508 al. J. Biol. Chem. N-methylcoclaurine, P. somniferum Vol. 277, No. 1, pp. laudanosoline

AY217336 830-835, 2002. N-methyl laudanosoline Cytochrome P450 CYP80B1 P. somniferum Pauli, HH and (S)-N- 80B1 AF191772 Kutchan, TM. Plant methylcoclaurine

J. 1998 (S)-3′-hydroxy-N- Mar; 13(6): 793-801. methylcoclaurine 4′-O- 4′OMT T. flavum Morishige, T. et. al. 3′-hydroxy-N- methyltransferase AY610510 J. Biol. Chem. Vol. methylcoclaurine

P. somniferum 275, No. 30, pp. reticuline AY217333, 23398-23405, 2000. norlaudanosoline

4′- AY217334 O-methyl norlaudanosoline laudanosoline

4′-O- methyl laudanosoline Berberine bridge BBE P. somniferum Facchini, P. J., (S)-reticuline

enzyme AF025430 Penzes, C., (S)-scoulerine Johnson, A. G. and Bull, D. Plant Physiol. 112 (4), 1669-1677 (1996). PUBMED 8972604 Reticuline 7-O- 7OMT P. somniferum Ounaroon, A. et. al. reticuline

methyltransferase AY268893 The Plant Journal laudanine (2003) 36, 808-819. Scoulerine 9-O- S9OMT T. flavum Samanani, N., (S)-scoulerine

methyltransferase AY610512 Park, S. U. and (S)- Facchini, P. J. Plant tetrahydrocolumbamine Cell 17 (3), 915-926 (2005). PUBMED 15722473 Canadine CYP719A T. flavum Samanani, N., (S)tetrahydrocolumbamine synthase AY610513 Park, S. U. and

(S)-canadine Facchini, P. J. Plant Cell 17 (3), 915-926 (2005). PUBMED 15722473 Ikezawa, N. et. al. J. Biol. Chem. Vol. 278, No. 40, pp. 38557-38565, 2003. NADPH P450 ATR1 A. thaliana Louerat-Oriou B, Reductase partner for reductase NM 118585 Perret A, Pompon D. cytochrome P450s Eur J Biochem. 1998 Ex. (S)tetrahydro- Dec 15; 258(3): 1040-9. columbamine

(S)- canadine Salutaridine SalR P. somniferum Ziegler, J. et. al. Plant salutaridine

salutaridinol reductase DQ316261 J. 48 (2), 177-192 (2006) Salutaridinol 7-O- SalAT P. somniferum Grothe, T., Lenz, R. salutaridinol

salutaridinol- acetyltransferase AF339913 and Kutchan, T. M. J. 7-O- Biol. Chem. 276 (33), acetate

thebaine 30717-30723 (2001). PUBMED 11404355 Codeine COR P. somniferum Unterlinner, B., codeinone

codeine reductase AF108432 Lenz, R. and Kutchan, T. M. Plant J. 18 (5), 465-475 (1999). PUBMED 1041769 Berbamunine CYP80A1 B. stolonifera Kraus, PF and 2 (R)-N- synthase U09610 Kutchan, TM. Proc methylcoclaurine

guattegaumerine Natl Acad Sci USA. (R)-N-methlcoclaurine + 1995 Mar (S)-N- 14; 92(6): 2071-5. methylcoclaurine

berbamunine

In one specific embodiment, the present invention relates to host cells that produce 4-Hydroxyphenylacetaldehyde (4-HPA) from tyrosine. For example, the host cells that produce 4-HPA from tyrosine comprise at least two heterologous coding sequences, wherein each of the heterologous coding sequences encodes a separate enzyme that is involved in the biosynthetic pathway that converts tyrosine to 4-HPA. In a more specific embodiment, the host cells that produce 4-HPA from tyrosine comprise L-tyrosine/dopa decarboxylase (TYDC, P. somniferum) and one of monoamine oxidase (maoA, E. coli or Homo sapiens) or Tyramine oxidase (tyn A, Klebsiella aerogenes). In another specific embodiment, the host cells that produce 4-HPA from tyrosine comprise Aromatic amino acid transaminase (ARO8/ARO9, S. cerevisiae) and Phenylpyruvate decarboxylase (ARO10, S. cerevisiae).

In another specific embodiment, the present invention relates to host cells that produce dopamine from tyrosine. For example, the host cells that produce dopamine comprise at least two heterologous coding sequences, wherein each of the heterologous coding sequences encodes a separate enzyme that is involved in the biosynthetic pathway that converts tyrosine to dopamine. In a more specific embodiment, the host cells that produce dopamine from tyrosine comprise a Tyrosine/dopa decarboxylase (TYDC, P. somniferum) and one of Cytochrome P450 2D6 (CYP2D6, H. sapiens) or Codon-Optimized Cytochrome P450 2D6 (CYP2D6, S. cerevisiae). To improve of the activity of CYP2D6, additional copies of the yeast NADPH-P450 reductase (yCPR1) may be expressed either from the chromosome or a plasmid. In another specific embodiment, the host cells that produce dopamine from tyrosine comprise a Tyrosine hydroxylase (PPO Agaricus bisporus; TH2, H. sapiens; TyrH, Rattus norvegicus) and Tyrosine/dopa decarboxylase (TYDC, P. somniferum).

In another specific embodiment, the present invention relates to host cells that convert tyrosine into norcoclaurine. Regardless of the source of the tyrosine starting material, the host cells of the present invention that produce norcoclaurine comprise at least three heterologous coding sequences, wherein each of the heterologous coding sequences encodes a separate enzyme, or its equivalent, that is involved in the biosynthetic pathway that converts tyrosine to norcoclaurine. In one specific embodiment, the host cells that produce norcoclaurine from tyrosine comprise the L-tyrosine/dopa decarboxylase (TYDC, P. somniferum), Monoamine oxidase (MaoA, E. coli), one of Cytochrome P450 2D6 (CYP2D6, H. sapiens) or Codon-Optimized Cytochrome P450 2D6 (CYP2D6, S. cerevisiae), and NCS (T. flavum or P. somniferum) coding sequences. To improve the activity of CYP2D6, additional copies of the endogenous yeast P450 NADPH reductase (yCPR1) may be expressed either from the chromosome or a plasmid. Of course, the embodiment above may further comprise additional heterologous coding sequences that will continue the synthetic pathway to create at least one additional metabolite. For example, the presence of the heterologous coding sequence that codes for Norcoclaurine 6-O-methyltransferase (6OMT, T. flavum, P. somniferum) will further metabolize norcoclaurine into coclaurine. Other pathways to/from norcoclaurine are depicted herein.

The embodiment above can serve as the basis of additional embodiments. For example, embodiments comprising TYDC, CYP2D6, maoA, NCS, 6OMT may further comprise the Coclaurine-N-methyltransferase (CNMT, T. flavum, P. somniferum) heterologous coding sequence, the embodiments of which may further comprise the Cytochrome P450 80B1 (CYP80B1, P. somniferum) heterologous coding sequence, the embodiments of which may further comprise the 4′-O-methyltransferase (4′OMT, T. flavum, P. somniferum) heterologous coding sequence, the embodiments of which may further comprise Berberine bridge enzyme (BBE, P. somniferum), etc. The embodiments in which the host cell comprises TYDC, CYP2D6, maoA, NCS, 6OMT and CNMT will generate N-methylcoclaurine ultimately from tyrosine. The embodiments in which the host cell comprises TYDC, CYP2D6, maoA, NCS, 6OMT, CNMT and CYP80B1 will generate 3′-Hydroxy-N-methylcoclaurine ultimately from tyrosine. The embodiments in which the host cell comprises TYDC, CYP2D6, maoA, NCS, 6OMT, CNMT CYP80B1 and 4′OMT will generate reticuline ultimately from tyrosine. The embodiments in which the host cell comprises TYDC, CYP2D6, maoA, NCS, 6OMT, CNMT CYP80B1, 4′OMT and BBE will generate scoulerine ultimately from tyrosine. All strains containing either CYP2D6 and/or CYP80B1 will likely require overexpression of CPR1 and/or ATR1 NADPH-P450 reductases for optimal activity.

Of course, the desired pathways need not start with tyrosine. For example, the synthetic pathways generated in the host cells may start with laudanosoline, methyl laudanosoline, norlaudanosoline, methyl norlaudanosoline, or another compound that may or may not be normally present in the endogenous BIA pathway. Thus, the starting material may be non-naturally occurring or the starting material may be naturally occurring. Additional examples of starting material include, but are not limited to, tyramine, dopamine, 4-HPA, 4-HPPA, norcoclaurine, coclaurine, N-methylcoclaurine, 3′-hydroxy-N-methylcoclaurine, reticuline, scoulerine, tetrahydrocolumbamine, canadine, laudanine, sanguinarine, morphine, codeine, codeinone and dimethyl tetrahydoisoquinoline, e.g., 6,7-dimethyl-1-2-3-4-tetrahydroisoquinoline. Other compounds may also be used as the starting material in the desired synthetic pathway and one of skill in the art would recognize the necessary starting material, based upon the synthetic pathway present in the host cell The source of the starting material may be from the host cell itself, e.g., tyrosine, or the starting material may be added or supplemented to the host cell from an outside source. For example, if the host cells are growing in liquid culture (an in vivo environment), the cell media may be supplemented with the starting material, e.g., tyrosine or norlaudanosoline, which is transported into the cells and converted into the desired products.

In one embodiment, the host cells of the claimed invention convert norlaudanosoline into reticuline. The norlaudanosoline may be generated through a normal or synthetic pathway in the same or different host cell, or the norlaudanosoline may be fed to the cells from the outside. In this particular embodiment, the host cells comprise 6OMT, CNMT and 4′OMT. This embodiment can serve as the basis of additional embodiments. For example, embodiments comprising 6OMT, CNMT and 4′OMT may further comprise BBE or Reticuline 7-O-methyltransferase (7OMT, P. somniferum) the embodiments of which may further comprise Scoulerine 9-O-methyltransferase (S9OMT, T. flavum), the embodiments of which may further comprise Canadine synthase (CYP719A, T. flavum). The embodiments that comprise 6OMT, CNMT and 4′OMT will generate reticuline from norlaudansoline. The embodiments that comprise 6OMT, CNMT, 4′OMT and BBE will generate scoulerine from norlaudanosoline. The embodiments that comprise 6OMT, CNMT, 4′OMT, and CYP2D6 with its reductase partner (CPR1 or ATR1) will generate salutaridine from norlaudanosoline. The embodiments that comprise 6OMT, CNMT, 4′OMT and 7OMT will generate laudanine from norlaudanosoline. The embodiments that comprise 6OMT, CNMT, 4′OMT, BBE and S9OMT will generate tetrahydrocolumbamine from norlaudanosoline. The embodiments that comprise 6OMT, CNMT, 4′OMT, BBE, S9OMT and CYP719A with its reductase partner ATR1 will generate canadine from norlaudanosoline.

The following is a non-exhaustive list of exemplary host organisms comprising heterologous coding sequences. The list is not intended to limit the scope of the invention in any way.

TABLE III Strain Background Plasmid(s) Enzyme(s) CSY73 W303α P_(ARO9)::TEF, P_(ARO10)::TEF CSY87 W303α pCS251 P_(TEF1)-AbPPO2 CSY88 W303α pCS251, pCS221 P_(TEF1)-AbPPO2, P_(TEF1)-TYDC2 CSY94 W303α pCS250, pCS283 P_(TEF1)-TfNCSΔ10, P_(TEF1)-TYDC2, P_(TEF1)-maoA CSY95 W303α ChrIV 122460::P_(TEF1)-TYDC2 CSY104 CSY95 ChrV 1100::P_(TEF1)-maoA CSY107 W303α pCS222, pCS330 P_(tetO7)-yCPR1, P_(TEF1)-TYDC2, P_(TEF1)-yCYP2D6 CSY116 CSY104 his3::P_(GPD)-yCPR1 CSY176 W303α his3::P_(GAL1)-TfNCS CSY177 W303α his3::P_(GAL1)-Tf6OMT CSY178 W303α his3::P_(GAL1)-PsNCS2 CSY179 W303α his3::P_(GAL1)-Ps6OMT CSY234 CSY194 pCS330 P_(TEF1)-TYDC2, P_(TEF1)-yCYP2D6 W(R) CSY235 CSY194 pCS222, pCS330 P_(tetO7)-yCPR1, P_(TEF1)-TYDC2, P_(TEF1)-yCYP2D6 W(R) CSY307 W303α pCS827, pCS830 P_(TEF1)-Ps6OMT, P_(TEF1)-PsCNMT, P_(TEF1)-Ps4′OMT CSY308 W303α pCS828, pCS830 P_(TEF1)-Ps6OMT, P_(TEF1)-TfCNMT, P_(TEF1)-Ps4′OMT CSY309 W303α pCS829, pCS830 P_(TEF1)-Tf6OMT, P_(TEF1)-PsCNMT, P_(TEF1)-Ps4′OMT CSY310 W303α pCS772, pCS830 P_(TEF1)-Tf6OMT, P_(TEF1)-TfCNMT, P_(TEF1)-Ps4′OMT CSY311 W303α pCS827, pCS831 P_(TEF1)-Ps6OMT, P_(TEF1)-PsCNMT, P_(TEF1)-Tf4′OMT CSY312 W303α pCS828, pCS831 P_(TEF1)-Ps6OMT, P_(TEF1)-TfCNMT, P_(TEF1)-Tf4′OMT CSY313 W303α pCS829, pCS831 P_(TEF1)-Tf6OMT, P_(TEF1)-PsCNMT, P_(TEF1)-Tf4′OMT CSY314 W303α pCS772, pCS831 P_(TEF1)-Tf6OMT, P_(TEF1)-TfCNMT, P_(TEF1)-Tf4′OMT CSY288 W303α his3::P_(TEF1)-Ps6OMT, leu2::P_(TEF1)-PsCNMT, ura3::P_(TEF1)-Ps4′OMT CSY334 W303α his3::P_(TEF1)-Ps6OMT, leu2::P_(TEF1)-PsCNMT, ura3::P_(TEF1)-Tf4′OMT CSY316 W303α his3::P_(GAL1)-Ps6OMT-loxP-KanR, leu2::P_(TEF1)-PsCNMT, ura3::P_(TEF1)-Ps4′OMT CSY317 W303α his3::P_(TEF1)-Ps6OMT, leu2::P_(GAL1)-PsCNMT-loxP-URA3, ura3::P_(TEF1)-Ps4′OMT CSY318 W303α his3::P_(TEF1)-Ps6OMT, leu2::P_(TEF1)-PsCNMT, ura3::P_(GAL1)-Ps4′OMT-loxP-LEU2 CSY319 W303α his3::P_(TEF1)-Ps6OMT, leu2::P_(TEF1)-PsCNMT, ura3::P_(GAL1)-Tf4′OMT-loxP-LEU2 CSY325 W303α his3::P_(GAL1)-Ps6OMT-loxP-KanR, leu2::P_(TEF1)-PsCNMT, ura3::P_(TEF1)-Ps4′OMT, gal2::HIS3 CSY326 W303α his3::P_(TEF1)-Ps6OMT, leu2::P_(GAL1)-PsCNMT-loxP-URA3, ura3::P_(TEF1)-Ps4′OMT, gal2::HIS3 CSY327 W303α his3::P_(TEF1)-Ps6OMT, leu2::P_(TEF1)-PsCNMT, ura3::P_(GAL1)-Ps4′OMT-loxP-LEU2, gal2::HIS3 CSY328 W303α his3::P_(TEF1)-Ps6OMT, leu2::P_(TEF1)-PsCNMT, ura3::P_(GAL1)-Tf4′OMT-loxP-LEU2, gal2::HIS3 CSY336 CSY288 pCS1018 P_(TEF1)-PsBBE CSY337 CSY288 pCS1070 P_(TEF1)-PsBBE, P_(TEF1)-TfS9OMT CSY338 CSY334 pCS1018 P_(TEF1)-PsBBE CSY339 CSY334 pCS1070 P_(TEF1)-PsBBE, P_(TEF1)-TfS9OMT CSY399 CSY288 pCS1018, pCS953, P_(TEF1)-PsBBE, P_(TEF1)-TfS9OMT, P_(TEF1)-TfCYP719A, P_(TEF1)-AtATR1 pCS1058 CSY400 CSY334 pCS1018, pCS953, P_(TEF1)-PsBBE, P_(TEF1)-TfS9OMT, P_(TEF1)-TfCYP719A, P_(TEF1)-AtATR1 pCS1058 CSY401 CSY288 pCS1163 P_(TEF1)-PsR7OMT CSY402 CSY334 pCS1163 P_(TEF1)-PsR7OMT CSY409 CSY334 his3::P_(TEF1)-Ps6OMT, leu2::P_(TEF1)-PsCNMT, ura3::P_(TEF1)-Tf4′OMT, trp1::P_(TEF1)- AtATR1(KanR) CSY410 CSY409 pCS1018, pCS953 P_(TEF1)-PsBBE, P_(TEF1)-TfS9OMT, P_(TEF1)-TfCYP719A CSY424 CSY334 pCS782 P_(TEF1)-yCYP2D6 CSY425 CSY409 pCS782 P_(TEF1)-yCYP2D6 CSY426 CSY288 trp1::P_(TEF1)-yCPR1(KanR) CSY427 CSY426 pCS782 P_(TEF1)-yCYP2D6

The promoters driving expression of the heterologous coding sequences can be constitutive promoters or inducible promoters, provided that the promoters can be active in the host cells. The heterologous coding sequences may be expressed from their native promoters, or non-native promoters may be used. Although not a requirement, such promoters should be medium to high strength in the host in which they are used. Promoters may be regulated or constitutive. In one embodiment, promoters that are not glucose repressed, or repressed only mildly by the presence of glucose in the culture medium, should be used. There are numerous suitable promoters, examples of which include promoters of glycolytic genes such as the promoter of the B. subtilis tsr gene (encoding fructose biphosphate aldolase) or GAPDH promoter from yeast S. cerevisiae (coding for glyceraldehyde-phosphate dehydrogenase) (Bitter G. A., Meth. Enzymol. 152:673 684 (1987)). Other strong promoters include the ADHI promoter of baker's yeast (Ruohonen L., et al, J. Biotechnol. 39:193 203 (1995)), the phosphate-starvation induced promoters such as the PHO5 promoter of yeast (Hinnen, A., et al, in Yeast Genetic Engineering, Barr, P. J., et al. eds, Butterworths (1989), and the alkaline phosphatase promoter from B. licheniformis (Lee. J. W. K., et al., J. Gen. Microbiol. 137:1127 1133 (1991)). Some specific examples of yeast promoters include inducible promoters such as Gal1-10, Gal1, GalL, GalS, repressible promoter Met25, tetO, and constitutive promoters such as glyceraldehyde 3-phosphate dehydrogenase promoter (GPD), alcohol dehydrogenase promoter (ADH), translation-elongation factor-1-alpha promoter (TEF), cytochrome c-oxidase promoter (CYC1), MRP7 promoter, etc. Autonomously replicating yeast expression vectors containing promoters inducible by hormones such as glucocorticoids, steroids, and thyroid hormones are also known and include, but are not limited to, the glucorticoid responsive element (GRE) and thyroid hormone responsive element (TRE). These and other examples are described U.S. Pat. No. 7,045,290, which is incorporated by reference, including the references cited therein. Additional vectors containing constitutive or inducible promoters such as alpha factor, alcohol oxidase, and PGH may be used. Additionally any promoter/enhancer combination (as per the Eukaryotic Promoter Data Base EPDB) could also be used to drive expression of genes. Similarly, one of skill in the art can choose appropriate promoters specific to the host cell, e.g., E. coli. One can also use promoter selection to optimize transcript, and hence, enzyme levels to maximize production while minimizing energy resources.

Vectors useful in the present invention include vectors for use in yeast and other cells. Yeast vectors can be broken up into 4 general categories: integrative vectors (YIp), autonomously replicating high copy-number vectors (YEp), autonomously replicating low copy-number vectors (YCp) and vectors for cloning large fragments (YACs). There are myriad of yeast expression vectors that are commercially available from sources such as, but not limited to, American Type Culture Collection (ATCC, Manassas, Va., USA) and Invitrogen Corp. (Carlsbad, Calif., USA).

Alternatively, insect cells may be used as host cells. In one embodiment, the polypeptides of the invention are expressed using a baculovirus expression system (see, Luckow et al, Bio/Technology, 1988, 6, 47; BACULOVIRUS EXPRESSION VECTORS: A LABORATORY MANUAL, O'Rielly et al. (Eds.), W.H. Freeman and Company, New York, 1992; and U.S. Pat. No. 4,879,236, each of which is incorporated herein by reference in its entirety). In addition, the MAXBAC™ complete baculovirus expression system (Invitrogen) can, for example, be used for production in insect cells.

Suitable host cells are discussed further in Goeddel, GENE EXPRESSION TECHNOLOGY: METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990). Alternatively, the recombinant expression vector can be transcribed and translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase.

Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional transformation or transfection techniques. As used herein, the terms “transformation” and “transfection” are intended to refer to a variety of art-recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, or electroporation. Suitable methods for transforming or transfecting host cells can be found in Sambrook, et al (MOLECULAR CLONING: A LABORATORY MANUAL. 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989), and other laboratory manuals.

For stable transfection of mammalian cells, it is known that, depending upon the expression vector and transfection technique used, only a small fraction of cells may integrate the foreign DNA into their genome. In order to identify and select these integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is generally introduced into the host cells along with the gene of interest. Various selectable markers include those that confer resistance to drugs, such as G418, hygromycin, dihydrofolate reductase (DHFR) and methotrexate. Nucleic acid encoding a selectable marker can be introduced into a host cell on the same vector as that encoding the functional enzyme, or its equivalent, or can be introduced on a separate vector. Cells stably transfected with the introduced nucleic acid can be identified by drug selection (e.g., cells that have incorporated the selectable marker gene will survive, while the other cells die).

Similarly, if the host cells are bacterial cells or animal or insect cells, there are a variety of commercially available expression vectors from which to choose. The choice of expression vector system will be apparent to one of skill in the art. One example of a yeast expression vector includes, but is not limited to, p413-TEF, p426-GPD, pCM190, pRS313, pYES2-NT/A, etc.

The choice of yeast plasmids will depend on the host. For Z. rouxii vectors based on the native cryptic plasmids pSR1 (Toh, E. et al., J. Bacteriol 151:1380-1390 (1982)), pSB1, pSB2, pSB3 or pSB4 (Toh-E et al., J. Gen. Microbiol 130:2527-2534 (1984)) may be used. Plasmid pSRT303D (Jeampipatkul, A., et al., Mol. Gen. Genet 206:88-94 (1987)) is an example of useful plasmid vector for Zygosaccharomyces yeast.

Methods of transforming yeast for the purposes of the present invention are well known in the art. Briefly, inserting DNA into yeast can be accomplished with techniques that include but are not limited to, those using spheroplasts, treating with lithium salts and electroporation. The methods are used to insert the heterologous coding sequences into the host cells such that the host cells will functionally express the enzymes or their equivalents and convert the starting/intermediate compounds into the desired end product.

Of course, the present invention also relates to methods of producing BIAs comprising culturing the host cells under conditions suitable for protein production such that the heterologous coding sequences are expressed in the host cells and act upon the starting/intermediate molecules.

In another embodiment of the present invention the host cells may also be used for functional genomics studies in both plant and animals. For example, host cells that are able to convert a given substrate such as norlaudanosoline into reticuline or other downstream BIAs can be used to screen libraries of plant cDNA sequences to discover enzymes which act on the product molecule. Of course, the screening methods can also be applied to precursors of BIAs. The screening methods can be accomplished by cloning a cDNA library from an organism, such as a plant or any organism that produces BIAs or an intermediate thereof, e.g., dopamine, into a suitable expression vector, e.g., a yeast expression vector, and transforming the library of plasmids into the engineered host cells. The standard LiAc/SSD/PEG method can be used. Single colonies can then be grown in liquid culture in the presence of substrate and the growth media or cell extract analyzed by LC-MS. New BIA molecules and the corresponding enzymes catalyzing their production can be identified by chromatogram peaks not present in strains lacking the cDNA library sequence. In vitro or other high-throughput methods can also be used if a suitable assay has been developed for a particular metabolite or byproduct, for example. An additional area of study where these engineered host cells can be employed is in the characterization of the recombinant enzymes known or suspected to be involved in these pathways. In particular, host cells expressing one or more heterologous coding sequences can be grown in the presence of various substrates and the resulting metabolites analyzed by LC-MS or other methods. Both in vivo and in vitro methods can be used in this manner to determine the substrate specificities of these enzymes and possibly discover new catalytic activities.

EXAMPLES Example 1 Construction of Yeast Expression Vectors

Standard molecular biology methods were used to construct the yeast expression vectors. Heterologous coding sequences for the genes of interest were received as plasmids, typically suited for expression in E. coli Coding sequences were either amplified by polymerase chain reaction (PCR) or excised from the vectors if restriction sites were compatible with the destination vector. Briefly, yeast shuttle vectors were constructed based on pCM185 and pCM180 which have an ampicillin resistance marker for maintenance in E. coli, URA and TRP selection markers, respectively, and a centromeric (ARS1/CEN4) origin of replication for yeast. To construct exemplary vectors, the TEF1 promoter was amplified from p413-TEF and the CYC1 promoter from pCM190 and assembled with the NCS coding sequence using PCR methods. Primers for each segment included suitable restriction sites both for cloning into the plasmid backbone and allowing the coding sequence to be easily replaced. This promoter-gene-terminator assembled PCR product was cloned into XhoI/BamHI sites of pCM185. Similar methods were used to make a second DNA insert containing the TEF promoter and 6OMT gene, which was then cloned into PmeI/NotI sites of the previous vector such that a CYC1 terminator for this gene was included on the plasmid backbone. Similar methods were used to construct the analogous vector with a TRP selection marker. In later constructs, the origin of replication was replaced with the 2μ origin using standard cloning procedures to allow for high copy expression in yeast. For cloning and expression of desired enzyme combinations, heterologous coding sequences were cloned into these primary vectors; restriction sites were changed using site-directed mutagenesis if necessary. To remove the second gene from these constructs, vectors were digested with MluI and self-ligated. To remove the first gene from these constructs, the vectors were digested with XhoI and either BamHI or PmeI, ends were blunted using the Klenow enzyme, and self-ligated. Alternatively, a single promoter vector can be made by cloning the gene of interest between the first promoter and second terminator. Analogous vectors with the HIS selection marker were also constructed as needed; for example, to express more than four heterologous coding sequences.

Example 2 Production of Codon-Optimized CYP2D6

The coding sequence for CYP2D6 was optimized based on codon usage in S. cerevisiae as well as RNA secondary structure, using commercially available service providers, such as DNA2.0 Inc. (Menlo Park, Calif., USA). There are other service providers that offer codon-optimized sequences, and some algorithms are available on the world-wide web.

The following is an example of a sequence for yeast codon-optimized CYP2D6 sequence; SalI and NotI restriction sites for cloning are underlined.

1 GTCGACATGG CATTGGAAGC ACTAGTCCCT TTAGCTGTAA TTGTAGCAAT 51 ATTCCTGTTA TTGGTAGACC TTATGCATAG AAGACAAAGA TGGGCTGCAA 101 GATACCCACC CGGCCCACTA CCCTTGCCAG GACTAGGTAA CCTTTTACAT 151 GTTGATTTCC AAAATACTCC GTACTGTTTT GATCAATTGA GGAGAAGATT 201 CGGAGATGTT TTCAGTCTGC AGTTGGCATG GACACCAGTC GTCGTTTTAA 251 ATGGTTTGGC TGCAGTAAGA GAAGCTTTAG TTACGCATGG CGAAGATACG 301 GCGGACAGGC CTCCTGTGCC CATTACACAG ATATTGGGTT TCGGACCTAG 351 ATCTCAGGGT GTATTCCTTG CCCGTTACGG TCCTGCGTGG AGAGAACAGA 401 GAAGGTTTTC TGTATCAACA CTTAGGAATT TGGGTCTAGG CAAGAAATCA 451 TTGGAACAAT GGGTGACCGA GGAAGCCGCT TGTTTGTGCG CAGCCTTTGC 501 TAATCATTCT GGCCGTCCTT TTAGACCTAA TGGATTACTT GATAAAGCAG 551 TATCTAATGT GATTGCCTCC TTAACATGTG GTAGACGTTT TGAGTACGAT 601 GACCCAAGGT TTTTGAGATT GTTAGATCTA GCACAAGAGG GATTAAAGGA 651 AGAAAGTGGT TTCTTGAGAG AGGTTTTGAA TGCTGTTCCA GTGCTATTAC 701 ACATTCCAGC CCTAGCTGGA AAGGTCTTGA GATTTCAAAA GGCTTTCTTA 751 ACGCAGCTTG ATGAGTTACT TACAGAGCAT AGGATGACTT GGGATCCTGC 801 TCAACCCCCG AGAGATCTAA CCGAGGCCTT CCTGGCTGAA ATGGAAAAAG 851 CAAAGGGTAA TCCGGAAAGT TCCTTCAATG ATGAAAACCT GAGAATTGTC 901 GTGGCGGACT TGTTCTCTGC CGGAATGGTG ACAACGTCTA CTACTTTGGC 951 CTGGGGACTT CTATTAATGA TTCTTCATCC AGACGTCCAG AGAAGAGTGC 1001 AACAAGAAAT AGATGATGTG ATAGGACAAG TTAGAAGGCC AGAAATGGGT 1051 GACCAGGCAC ATATGCCATA TACGACTGCT GTAATCCATG AAGTGCAACG 1101 TTTTGGGGAC ATTGTCCCCT TGGGAATGAC CCACATGACT TCTCGTGATA 1151 TTGAAGTACA AGGTTTCAGA ATACCAAAGG GAACTACGCT GATTACGAAT 1201 CTGTCTAGCG TGCTAAAAGA CGAAGCTGTC TGGGAGAAGC CATTTAGGTT 1251 TCATCCAGAA CACTTCTTAG ACGCTCAGGG TCATTTCGTA AAGCCTGAAG 1301 CATTCCTTCC GTTTAGTGCC GGACGTAGGG CGTGTTTGGG TGAACCATTA 1351 GCTAGAATGG AATTATTCCT TTTTTTTACA TCTTTATTGC AGCACTTTTC 1401 ATTTTCTGTT CCGACTGGCC AACCCAGACC TAGCCATCAT GGTGTTTTTG 1451 CTTTCCTAGT TTCTCCCTCT CCTTATGAAT TATGCGCGGT TCCCCGTTGA 1501 GCGGCCGC

The following is an example of human MAO A sequence optimized for expression in yeast and includes 8 nt preceding the start codon, which is underlined:

1 AATTAATAAT GGAAAACCAA GAAAAGGCTT CTATCGCGGG CCACATGTTC 51 GACGTAGTCG TGATCGGAGG TGGCATTTCA GGACTATCTG CTGCCAAACT 101 CTTGACTGAA TATGGCGTTA GTGTTTTGGT TTTAGAAGCT CGGGACAGGG 151 TTGGAGGAAG AACATATACT ATAAGGAATG AGCATGTTGA TTACGTAGAT 201 GTTGGTGGAG CTTATGTGGG ACCAACCCAA AACAGAATCT TACGCTTGTC 251 TAAGGAGCTG GGCATAGAGA CTTACAAAGT GAATGTCAGT GAGCGTCTCG 301 TTCAATATGT CAAGGGGAAA ACATATCCAT TTCGGGGCGC CTTTCCACCA 351 GTATGGAATC CCATTGCATA TTTGGATTAC AATAATCTGT GGAGGACAAT 401 AGATAACATG GGGAAGGAGA TTCCAACTGA TGCACCCTGG GAGGCTCAAC 451 ATGCTGACAA ATGGGACAAA ATGACCATGA AAGAGCTCAT TGACAAAATC 501 TGCTGGACAA AGACTGCTAG GCGGTTTGCT TATCTTTTTG TGAATATCAA 551 TGTGACCTCT GAGCCTCACG AAGTGTCTGC CCTGTGGTTC TTGTGGTATG 601 TGAAGCAGTG CGGGGGCACC ACTCGGATAT TCTCTGTCAC CAATGGTGGC 651 CAGGAACGGA AGTTTGTAGG TGGATCTGGT CAAGTGAGCG AACGGATAAT 701 GGACCTCCTC GGAGACCAAG TGAAGCTGAA CCATCCTGTC ACTCACGTTG 751 ACCAGTCAAG TGACAACATC ATCATAGAGA CGCTGAACCA TGAACATTAT 801 GAGTGCAAAT ACGTAATTAA TGCGATCCCT CCGACCTTGA CTGCCAAGAT 851 TCACTTCAGA CCAGAGCTTC CAGCAGAGAG AAACCAGTTA ATTCAGCGGC 901 TTCCAATGGG AGCTGTCATT AAGTGCATGA TGTATTACAA GGAGGCCTTC 951 TGGAAGAAGA AGGATTACTG TGGCTGCATG ATCATTGAAG ATGAAGATGC 1001 TCCAATTTCA ATAACCTTGG ATGACACCAA GCCAGATGGG TCACTGCCTG 1051 CCATCATGGG CTTCATTCTT GCCCGGAAAG CTGATCGACT TGCTAAGCTA 1101 CATAAGGAAA TAAGGAAGAA GAAAATCTGT GAGCTCTATG CCAAAGTGCT 1151 GGGATCCCAA GAAGCTTTAC ATCCAGTGCA TTATGAAGAG AAGAACTGGT 1201 GTGAGGAGCA GTACTCTGGG GGCTGCTACA CGGCCTACTT CCCTCCTGGG 1251 ATCATGACTC AATATGGAAG GGTGATTCGT CAACCCGTGG GCAGGATTTT 1301 CTTTGCGGGC ACAGAGACTG CCACAAAGTG GAGCGGCTAC ATGGAAGGGG 1351 CAGTTGAGGC TGGAGAACGA GCAGCTAGGG AGGTCTTAAA TGGTCTCGGG 1401 AAGGTGACCG AGAAAGATAT CTGGGTACAA GAACCTGAAT CAAAGGACGT 1451 TCCAGCGGTA GAAATCACCC ACACCTTCTG GGAAAGGAAC CTGCCCTCTG 1501 TTTCTGGCCT GCTGAAGATC ATTGGATTTT CCACATCAGT AACTGCCCTG 1551 GGGTTTGTGC TGTACAAATA CAAGCTCCTG CCACGGTCTT GA

Example 3 Production of Truncated NCS

The T. flavum NCS sequence, courtesy of Peter Facchini, was that of the N-terminal Δ10 truncation. To construct the full-length gene, the first 30 nucleotides (coding for 10 amino acids) were included in the forward primer sequence used for cloning the gene. For other variants, such as the Δ19 N-terminal truncation, the forward primer was designed to amplify the gene beginning at the 20^(th) amino acid and including an additional start codon if the new starting amino acid was not a methionine. To compare expression levels qualitatively, we cloned each variant into a yeast expression vector containing a V5 epitope tag (pYES2-NT/A, Invitrogen), transformed the plasmids into the wild-type yeast strain using the standard LiAc/SSD/PEG method (Gietz, R D and Woods, R A. Methods in Enzymology, Vol 350, pp. 87-96, 2002), and performed Western blot analysis on the total protein lysates. This showed that the T. flavum NCSΔ10 was the most highly expressed in yeast, consistent with E. coli studies.

Example 4 Measurement of Dopamine Production

Yeast strains were constructed to produce dopamine from tyrosine. A high-copy TRP plasmid containing TYDC2 and CYP2D6 both between the TEF1 promoter and CYC1 terminator was tested in various yeast strains. The standard LiAc/SSD/PEG method was used to transform the plasmid(s) into yeast. The CYP2D6 activity is enhanced as evidenced by an increase in dopamine production when the background strain is W(R), which overexpresses CPR1 from the chromosome. An additional increase in dopamine accumulation is observed when cells are co-transformed with a second plasmid expressing additional copies of CPR1 from the tetO₇ promoter. Alterations to the growth media have also been shown to enhance the activity of P450s in yeast (Jiang, H and Morgan, J A. Biotechnology and Bioengineering, Vol. 85, No 2, pp. 130-7). Media containing 3.4 g/L yeast nitrogen base, 5 g/L casein hydrolysate, and 20 g/L glucose was shown to improve dopamine production over standard SC media. For measurement of tyramine and dopamine accumulation, the growth media can be analyzed directly by LC-MS. Intracellular concentrations can be estimated by preparation of cell extracts. Briefly, cells are pelleted at 6000 rpm for 5 min at 4° C. and the supernatant carefully removed. Using a pipette, pellets of the cell paste are dropped into liquid nitrogen and a mortar and pestle used to homogenize the cells. Metabolites are extracted with methanol and solids removed by centrifugation; the liquid is passed through a syringe filter to remove any remaining debris. Appropriate dilutions are made prior to LC-MS analysis using 20 μL injection volume. Samples were run on an Agilent ZORBAX SB-Aq 3×250 mm, 5 μm column with 0.1% acetic acid as solvent A and methanol as solvent B. A gradient elution is used to separate the metabolites of interest: 0-10 min at 100% A, 10-30 min 0-90% B, 30-35 min 90-0% B, followed by a 5 min equilibration at 100% A. Tyrosine, tyramine, dopamine, and L-dopa elute within the first 10 min so that an isocratic elution may be used if analyzing only these and/or similar metabolites. Following LC separation, metabolites are injected into an Agilent 6320 ion trap MSD for detection. Extracted ion chromatograms are used to identify peaks for selected ions and compared to available standards in terms of elution time and MS fingerprint.

Example 5 Measurement of Norcoclaurine Production

Norcoclaurine was produced using both in vivo and in vitro methods. For in vitro experiments, protocols were based on published work (Samanani, N, Liscombe, D K, and Facchini, P. The Plant Journal, Vol. 40, pp. 302-313). Both E. coli and yeast cells expressing NCS variants were lysed with B-PER or Y-PER (Pierce), respectively, and total protein extracts were used in the assay. In vitro reactions were analyzed by LC-MS. Samples were run on an Agilent ZORBAX SB-Aq 3×250 mm, 5 μm column with 0.1% acetic acid as solvent A and methanol as solvent B. A gradient elution is used to separate the metabolites of interest: 0-10 min at 100% A, 10-30 min 0-90% B, 30-35 min 90-0% B, followed by a 5 min equilibration at 100% A. Following LC separation, metabolites are injected into an Agilent 6320 ion trap MSD for detection. Norcoclaurine elutes at 21.2 min using this method. Without a commercially available standard, norcoclaurine is confirmed by its characteristic fragmentation pattern. With the ion trap set to perform MS/MS in the 272 ion, the major fragments of the parent ion of m/z=107 (benzyl) and m/z=161 (isoquinoline) were identified. For in vivo experiments, yeast cells expressing NCS were supplemented with dopamine (between 10 μM and 1 mM) and 4-HPA (custom synthesized from Biosynthesis, concentration undetermined). The above method was used to analyze the growth media directly to detect extracellular norcoclaurine accumulation.

Example 6 Production and Measurement of Reticuline and its Intermediates

For production of reticuline from the substrate norlaudanosoline (or laudanosoline), yeast cells were transformed with plasmids expressing various combinations of 6OMT, CNMT, and 4′OMT coding sequences using the standard LiAc/SSD/PEG method. Yeast cells were grown in SC media lacking uracil and tryptophan for plasmid maintenance. The growth media (SC-URA/-TRP) was supplemented with norlaudanosoline at concentrations between 1-5 mM from a 10 or 20 mM stock solution in water. Cells were grown in test tubes at 30° C. with shaking at 200 rpm; volumes ranged from 1-10 mL and time points were from 8 hrs up to 1 week following addition of substrate. Cells (or an aliquot of culture) were pelleted and the supernatant analyzed directly by LC-MS. Samples were run on an Agilent ZORBAX SB-Aq 3×250 mm, 5 μm column with 0.1% acetic acid as solvent A and methanol as solvent B. A gradient elution is used to separate the metabolites of interest: 0-10 min at 100% A, 10-30 min 0-90% B, 30-35 min 90-0% B, followed by a 5 min equilibration at 100% A. Following LC separation, metabolites are injected into an Agilent 6320 ion trap MSD for detection. Reticuline elutes at 23.6 min with this method and the correct structure of this metabolite is confirmed by performing MS/MS on the 330 ion to produce the fragments m/z=136 and m/z=192. Based on the results from plasmid-based expression, the P. somniferum 6OMT and CNMT were selected with either the P. somniferum or T. flavum 4′OMT as the best enzyme combinations, and these sequences were integrated into the chromosome using homologous recombination. In addition, strains were constructed to test each enzyme individually, typically using a single high-copy plasmid with the TEF promoter driving expression of the coding sequence. For the 6OMT activity, the correct product, 6-O-methyl norlaudanosoline, was detected by LC-MS when norlaudanosoline was present in the growth media; in vitro assays based on published protocols and using yeast lysates were also used to confirm this activity (Ounaroon, A et. A1. The Plant Journal, Vol. 36, pp. 808-19). Yeast cells expressing the CNMT enzyme converted 6,7-dimethyl-1,2,3,4-tetrahydroisoquinoline present at 1 mM in the growth media to the correct N-methylated product in vivo. Yeast cells expressing the 4′OMT enzyme methylated the substrates norlaudanosoline and laudanosoline in vivo. The correct location of the methyl group addition to each substrate is confirmed by performing MS/MS on the selected ion in all cases.

Example 7 Production and Measurement of Downstream Metabolites of Reticuline

For production of metabolites beyond reticuline, yeast strains with chromosomal integrations of 6OMT, CNMT, and 4′OMT were used when possible. These host cells contained no selection markers, allowing for additional coding sequences to be introduced on plasmids. For production of scoulerine, a plasmid expressing BBE was transformed into reticuline-producing strain(s) using the standard LiAc/SSD/PEG method. For production of tetrahydrocolumbamine, plasmids expressing BBE and S9OMT were cotransformed. For production of canadine, plasmids expressing BBE, S9OMT, CYP719A, and ATR1 were contransformed. Construction of a yeast strain to stably express ATR1 along with the reticuline-producing enzymes and transformed with BBE, S9OMT, and CYP719A plasmids showed an increase in CYP719A activity (compared to plasmid-based expression of ATR1) demonstrated by increased conversion of substrate to canadine. Metabolites were detected in the growth media when supplemented with 1 mM or greater norlaudanosoline or laudanosoline. Samples were run on an Agilent ZORBAX SB-Aq 3×250 mm, 5 μm column with 0.1% acetic acid as solvent A and methanol as solvent B. A gradient elution is used to separate the metabolites of interest: 0-10 min at 100% A, 10-30 min 0-90% B, 30-35 min 90-0% B, followed by a 5 min equilibration at 100% A. Following LC separation, metabolites are injected into an Agilent 6320 ion trap MSD for detection. For each metabolite in the pathway, MS/MS was performed and the spectra compared. Based on the patterns observed, it can be confirmed that the peak identified as canadine, for example, has the same molecular structure as its precursor, tetrahydrocolumbamine. For production of salutaridine, the yeast strain stably expressing 6OMT, CNMT, 4′OMT, and ATR1 was transformed with a plasmid expressing CYP2D6. When the growth media was supplemented with norlaudanosoline or laudanosoline, salutaridine was detected by LC-MS. The elution time of salutaridine is identical to that of scoulerine as expected although its fragmentation pattern, particularly the 165 ion, indicates that the structure is in the correct (R) conformation based on the reported fragmentation pattern of salutaridinol 

1. A host cell that produces norcoclaurine, the host cell comprising at least a first, second, third and fourth heterologous coding sequence, wherein the first, second, third and fourth heterologous coding sequences encode a first, second, third and fourth enzyme, respectively, wherein the first, second, third and fourth enzymes are involved in a metabolic pathway that converts tyrosine in the host cell into the norcoclaurine.
 2. The host cell of claim 1, wherein the cell further comprises a fifth heterologous coding sequence, wherein the fifth heterologous coding sequence encodes a fifth enzyme that converts the norcoclaurine into coclaurine.
 3. The host cell of claim 2, wherein the cell further comprises a sixth heterologous coding sequence, wherein the sixth heterologous coding sequence encodes a sixth enzyme that converts the coclaurine into N-methylcoclaurine.
 4. The host cell of claim 3, wherein the cell further comprises a seventh heterologous coding sequence, wherein the seventh heterologous coding sequence encodes a seventh enzyme that converts the methylcoclaurine into 3′-hydroxy-N-methylcoclaurine.
 5. The host cell of claim 4, wherein the cell further comprises an eighth heterologous coding sequence, wherein the eighth heterologous coding sequence encodes an eighth enzyme that converts the 3′-hydroxy-N-methylcoclaurine into reticuline.
 6. The host cell of claim 5, wherein the cell further comprises a ninth heterologous coding sequence, wherein the ninth heterologous coding sequence encodes a ninth enzyme that converts the reticuline into scoulerine, laudanine, or salutaridine.
 7. The host cell of claim 6, wherein the cell further comprises a tenth heterologous coding sequence, wherein the tenth heterologous coding sequence encodes a tenth enzyme that converts the scoulerine into tetrahydrocolumbamine or the salutaridine into salutaridinol.
 8. The host cell of claim 7, wherein the cell further comprises an eleventh heterologous coding sequence, wherein the eleventh heterologous coding sequence encodes an eleventh enzyme that converts the tetrahydrocolumbamine into canadine or the salutaridinol into salutaridinol-7-O-acetate and/or thebaine.
 9. A host cell that produces reticuline from norlaudanosoline, the host cell comprising a first second and third heterologous coding sequence, wherein the first, second and third heterologous coding sequences encode a first, second and third enzyme that are involved in a metabolic pathway that converts the norlaudanosoline into the reticuline.
 10. The host cell of claim 9, wherein the host cell further comprises a fourth heterologous coding sequence, wherein the fourth heterologous coding sequence encodes a fourth enzyme that converts the reticuline into laudanine, scoulerine, or salutaridine.
 11. The host cell of claim 10, wherein the host cell further comprises a fifth heterologous coding sequence, wherein the fifth heterologous coding sequence encodes a fifth enzyme that converts the scoulerine into tetrahydrocolumbamine or the salutaridine into salutaridinol.
 12. The host cell of claim 11, wherein the host cell further comprises a sixth heterologous coding sequence, wherein the sixth heterologous coding sequence encodes a sixth enzyme that converts the tetrahydrocolumbamine into canadine or the salutaridinol into salutaridinol-7-O-acetate and/or thebaine.
 13. A host cell that produces dopamine, the host cell comprising a first and second heterologous coding sequence, wherein the first and second heterologous coding sequences encode a first and second enzyme, respectively, that are involved in a metabolic pathway that converts tyrosine naturally produced in the host cell into the dopamine.
 14. A host cell that produces 4-HPA, the host cell comprising a first and second heterologous coding sequence, wherein the first and second heterologous coding sequences encode a first and second enzyme, respectively, that are involved in a metabolic pathway that converts tyrosine naturally produced in the host cell into the 4-HPA.
 15. A method of preparing a metabolite of tyrosine comprising a) culturing a host cell under conditions suitable for protein production, said host cell comprising a first and second heterologous coding sequence, wherein the first and second heterologous coding sequences encode a first and second enzyme, respectively, that are involved in a metabolic pathway that converts the tyrosine into the metabolite, b) optionally adding tyrosine to the cell culture, and c) recovering the metabolite from the cell culture.
 16. The method of claim 15, wherein the metabolite is dopamine.
 17. The method of claim 15, wherein the metabolite is 4-hydroxyphenylacetaldehyde.
 18. The method of claim 15, wherein the host cell further comprises a third and fourth heterologous coding sequence that encodes a third and fourth enzyme involved in the metabolic pathway the converts tyrosine into the metabolite.
 19. The method of claim 18, wherein the metabolite is norcoclaurine.
 20. The method of claim 19, wherein the host cell further comprises a fifth heterologous coding sequence that encodes a fifth enzyme involved in the metabolic pathway that converts tyrosine into the metabolite.
 21. The method of claim 20, wherein the metabolite is coclaurine.
 22. The method of claim 21, wherein the host cell further comprises a sixth heterologous coding sequence that encodes a sixth enzyme involved in the metabolic pathway that converts tyrosine in the metabolite.
 23. The method of claim 22, wherein the metabolite is N-methylcoclaurine.
 24. The method of claim 23, wherein the host cell further comprises a seventh heterologous coding sequence that encodes a seventh enzyme involved in the metabolic pathway that converts tyrosine into the metabolite.
 25. The method of claim 24, wherein the metabolite is 3′-hydroxy-N-methylcoclaurine.
 26. The method of claim 25, wherein the host cell further comprises an eighth heterologous coding sequence that encodes an eighth enzyme involved in the metabolic pathway that converts tyrosine into the metabolite.
 27. The method of claim 26, wherein the metabolite is reticuline.
 28. The method of claim 27, wherein the host cell further comprises a ninth heterologous coding sequence that encodes a ninth enzyme involved in the metabolic pathway that converts tyrosine into the metabolite.
 29. The method of claim 28, wherein the metabolite is scoulerine, laudanine, or salutaridine.
 30. The method of claim 29, wherein the host cell further comprises a tenth heterologous coding sequence that encodes a tenth enzyme involved in the metabolic pathway that converts tyrosine into the metabolite.
 31. The method of claim 30, wherein the metabolite is tetrahydrocolumbamine or salutaridinol.
 32. The method of claim 31, wherein the host cell further comprises an eleventh heterologous coding sequence that encodes an eleventh enzyme involved in the metabolic pathway that converts tyrosine into the metabolite.
 33. The method of claim 32, wherein the metabolite is canadine or salutaridinol-7-O-acetate and/or thebaine.
 34. A method of preparing a metabolite of norlaudanosoline comprising a) culturing a host cell under conditions suitable for protein production, said host cell comprising a first and second heterologous coding sequence, wherein the first and second heterologous coding sequences encode a first and second enzyme, respectively, that are involved in a metabolic pathway that converts the norlaudanosoline into the metabolite, b) adding norlaudanosoline to the cell culture, and c) recovering the metabolite from the cell culture.
 35. The method of claim 34, wherein the host cell further comprises a third heterologous coding sequence that encodes a third enzyme involved in the metabolic pathway the converts norlaudanosoline into the metabolite.
 36. The method of claim 35, wherein the host cell further comprises a fourth heterologous coding sequence that encodes a fourth enzyme involved in the metabolic pathway the converts norlaudanosoline into the metabolite.
 37. A method of screening a nucleotide library for at least one nucleotide sequence that codes for at least one peptide with enzymatic activity directed towards benzylisoquinolines, the method comprising a) providing a nucleic acid library to a culture of host cells, the host cells being engineered to produce a first benzylisoquinoline and the library being constructed for peptide expression in the host cell, b) optionally providing a starting material to the host cells, c) culturing the host cells under conditions appropriate for peptide expression, d) determining the presence or absence of a second benzylisoquinoline, wherein the presence of a second benzylisoquinoline indicates the presence of at least one nucleotide sequence in the nucleotide library that codes for at least one peptide with enzymatic activity directed towards benzylisoquinolines. 